Search CORE

25 research outputs found

XML Reconstruction View Selection in XML Databases: Complexity Analysis and Approximation Scheme

Author: A. Balmin
A. Chebotko
D. Florescu
D. Kossmann
H. Gupta
H. Gupta
H.V. Jagadish
M. Atay
M.R. Garey
R. Chirkova
S. Abiteboul
S. Chaudhuri
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Query evaluation in an XML database requires reconstructing XML subtrees rooted at nodes found by an XML query. Since XML subtree reconstruction can be expensive, one approach to improve query response time is to use reconstruction views - materialized XML subtrees of an XML document, whose nodes are frequently accessed by XML queries. For this approach to be efficient, the principal requirement is a framework for view selection. In this work, we are the first to formalize and study the problem of XML reconstruction view selection. The input is a tree

T

, in which every node

i

has a size

c_i

and profit

p_i

, and the size limitation

C

. The target is to find a subset of subtrees rooted at nodes

i_1,\cdots, i_k

respectively such that

c_{i_1}+\cdots +c_{i_k}\le C

, and

p_{i_1}+\cdots +p_{i_k}

is maximal. Furthermore, there is no overlap between any two subtrees selected in the solution. We prove that this problem is NP-hard and present a fully polynomial-time approximation scheme (FPTAS) as a solution

arXiv.org e-Print Archive

Crossref

FunMap: Efficient Execution of Functional Mappings for Knowledge Graph Creation

Author: A Chebotko
A Poggi
B De Meester
D Calvanese
E Rahm
G Gawriljuk
JT den Dunnen
M Lefrançois
MN Mami
O Corcho
S Gupta
S Jozashoori
Publication venue
Publication date: 01/01/2020
Field of study

Data has exponentially grown in the last years, and knowledge graphs constitute powerful formalisms to integrate a myriad of existing data sources. Transformation functions -- specified with function-based mapping languages like FunUL and RML+FnO -- can be applied to overcome interoperability issues across heterogeneous data sources. However, the absence of engines to efficiently execute these mapping languages hinders their global adoption. We propose FunMap, an interpreter of function-based mapping languages; it relies on a set of lossless rewriting rules to push down and materialize the execution of functions in initial steps of knowledge graph creation. Although applicable to any function-based mapping language that supports joins between mapping rules, FunMap feasibility is shown on RML+FnO. FunMap reduces data redundancy, e.g., duplicates and unused attributes, and converts RML+FnO mappings into a set of equivalent rules executable on RML-compliant engines. We evaluate FunMap performance over real-world testbeds from the biomedical domain. The results indicate that FunMap reduces the execution time of RML-compliant engines by up to a factor of 18, furnishing, thus, a scalable solution for knowledge graph creation

arXiv.org e-Print Archive

Crossref

Repositorium für Naturwissenschaften und Technik

Secure abstraction views for scientific workflow provenance querying

Author: A Chebotko
F Fotouhi
Ping Yang
Seunghan Chang
Shiyong Lu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Conformance test cases for the RDF mapping language (RML)

Author: A Chebotko
Diego Calvanese
J Lehmann
Juan F. Sequeda
K Kyzirakos
M Koubarakis
M-E Vidal
N Konstantinou
R Battle
Stefan Bischof
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

Ghent University Academic Bibliography

Improving STEM Education in Research: Preliminary Report on the Development of a Computer-Assisted Student-Mentor Research Community

Author: Ammons David
Andoh-Baidoo Francis
Carlson Ralph
Chebotko Artem
Pearson Thomas
Rampersad Joanne
Reilly Christine
Rios David A.
Tomai Emmett
Weimer Amy A.
Weimer Nicholas
Winkle Robert
Publication venue: ScholarWorks @ UTRGV
Publication date: 01/09/2012
Field of study

Research education in STEM disciplines currently suffers from 1) The inability to feasibly collect highly detailed data on both the student’s and mentor’s activities; 2) The lack of tools to assist students and mentors in organizing and managing their research activities and environments; and 3) The inability to correlate a student’s assessment results with their actual research activities. Together these three problems act to impede both the improvement and educational quality of student research experiences. We propose a computer-assisted student-mentor research community as a solution to these problems. Within this community setting, students and their mentors are provided tools to make their work easier, much like a word processor makes writing a letter easier. Through their use of these tools, details of student-mentor activities are automatically recorded in a relational database, without burdening users with the responsibility of archiving data. Equally important, student assessments of outcome can be directly related to student activity, allowing educators to identify practices resulting in successful research experiences. Community tools also facilitate the use of labor-intensive teaching laboratories involving real inquiry-based research. The community structure has the added benefit of allowing students to see, communicate and interact more freely with other students and their projects, thus enriching the student’s research experience. We provide herein a preliminary report on the development and testing of a prototype, student-mentor research community, and present its tools, an assessment of student interest in participating in the community, and discuss its further development into a nationally-available student-mentor research community

Scholarworks@UTRGV Univ. of Texas RioGrande Valley

Answering SPARQL queries over databases under OWL 2 QL entailment regime

Author: A. Chebotko
A. Chortaras
A. Polleres
B. Glimm
C. Lutz
D. Calvanese
E. Sirin
I. Kollia
J. Dolby
J.F. Sequeda
M. König
M. Rodríguez-Muro
R. Angles
R. Kontchakov
U.S. Chakravarthy
Y. Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

We present an extension of the ontology-based data access platform Ontop that supports answering SPARQL queries under the OWL 2 QL direct semantics entailment regime for data instances stored in relational databases. On the theoretical side, we show how any input SPARQL query, OWL 2 QL ontology and R2RML mappings can be rewritten to an equivalent SQL query solely over the data. On the practical side, we present initial experimental results demonstrating that by applying the Ontop technologies—the tree-witness query rewriting, T-mappings compiling R2RML mappings with ontology hierarchies, and T-mapping optimisations using SQL expressivity and database integrity constraints—the system produces scalable SQL queries

CiteSeerX

Crossref

Birkbeck Institutional Research Online

SPARQL-to-SQL on Internet of Things Databases and Streams

Author: A Chebotko
B Bishop
C Buil-Aranda
D Le-Phuoc
DF Barbieri
JP Calbimonte
M Rodriguez-Muro
P Barnaghi
T Neumann
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

To realise a semantic Web of Things, the challenge of achieving efficient Resource Description Format (RDF) storage and SPARQL query performance on Internet of Things (IoT) devices with limited resources has to be addressed. State-of-the-art SPARQL-to-SQL engines have been shown to outperform RDF stores on some benchmarks. In this paper, we describe an optimisation to the SPARQL-to-SQL approach, based on a study of time-series IoT data structures, that employs metadata abstraction and efficient translation by reusing existing SPARQL engines to produce Linked Data ‘just-in-time’. We evaluate our approach against RDF stores, state-of-the-art SPARQL-to-SQL engines and streaming SPARQL engines, in the context of IoT data and scenarios. We show that storage efficiency, with succinct row storage, and query performance can be improved from 2 times to 3 orders of magnitude

Crossref

Southampton (e-Prints Soton)

Virtual Infrastructure Optimisation

Author: A Bulut
A Chebotko
A Kritikakou
A Nussbaum
A Papageorgiou
A Taal
C Müller
D Downey
D Kreutz
D Li
G Casale
H Zhou
I Foster
J Wang
MA Rodriguez
N Laranjeiro
N Serrano
P Ingwersen
P Štefanič
PA Laplante
S Abrishami
S Alawneh
S Koulouzis
S Koulouzis
S Taherizadeh
SE Dashti
X Liao
Y Hu
Y Hu
Z Cai
Z Fu
Z Usmani
Z Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Bridging the Semantic Web and NoSQL Worlds: Generic SPARQL Query Translation and Application to MongoDB

Author: A Chebotko
A Schwarte
C Bizer
DE Spanos
F Michel
F Michel
J Pérez
J Sequeda
J Unbehauen
JF Sequeda
M Rodríguez-Muro
M Rodríguez-Muro
N Bikakis
N Bikakis
R Verborgh
T Heath
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 04/01/2019
Field of study

International audienceRDF-based data integration is often hampered by the lack of methods to translate data locked in heterogeneous silos into RDF representations. In this paper, we tackle the challenge of bridging the gap between the Semantic Web and NoSQL worlds, by fostering the development of SPARQL interfaces to heterogeneous databases. To avoid defining yet another SPARQL translation method for each and every database, we propose a two-phase method. Firstly, a SPARQL query is translated into a pivot abstract query. This phase achieves as much of the translation process as possible regardless of the database. We show how optimizations at this abstract level can save subsequent work at the level of a target database query language. Secondly, the abstract query is translated into the query language of a target database, taking into account the specific database capabilities and constraints. We demonstrate the effectiveness of our method with the MongoDB NoSQL document store, such that arbitrary MongoDB documents can be aligned on existing domain ontologies and accessed with SPARQL. Finally, we draw on a real-world use case to report experimental results with respect to the effectiveness and performance of our approach

Crossref

INRIA a CCSD electronic archive server

Efficient handling of SPARQL OPTIONAL for OBDA

Author: A Chebotko
A Poggi
C Bizer
C Galindo-Legaria
D Calvanese
D Calvanese
G Giacomo De
J Pérez
JF Sequeda
M Chaloupka
M Rodriguez-Muro
M Rodríguez-Muro
P Guagliardo
R Elmasri
R Kontchakov
US Chakravarthy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

OPTIONAL is a key feature in SPARQL for dealing with missing information. While this operator is used extensively, it is also known for its complexity, which can make efficient evaluation of queries with OPTIONAL challenging. We tackle this problem in the Ontology-Based Data Access (OBDA) setting, where the data is stored in a SQL relational database and exposed as a virtual RDF graph by means of an R2RML mapping. We start with a succinct translation of a SPARQL fragment into SQL. It fully respects bag semantics and three-valued logic and relies on the extensive use of the LEFT JOIN operator and COALESCE function. We then propose optimisation techniques for reducing the size and improving the structure of generated SQL queries. Our optimisations capture interactions between JOIN, LEFT JOIN, COALESCE and integrity constraints such as attribute nullability, uniqueness and foreign key constraints. Finally, we empirically verify effectiveness of our techniques on the BSBM OBDA benchmark

Crossref

Birkbeck Institutional Research Online

Kent Academic Repository